[livres divers classés par sujet] [Informatique] [Algorithmique] [Programmation] [Mathématiques] [Hardware] [Robotique] [Langage] [Intelligence artificielle] [Réseaux]
[Bases de données] [Télécommunications] [Chimie] [Médecine] [Astronomie] [Astrophysique] [Films scientifiques] [Histoire] [Géographie] [Littérature]

Frequent Itemset Discovery with SQL Using Universal Quantification

contributor Anwendersoftware (IPVR)
creator Rantzau, Ralf
date 2002-03
description Algorithms for finding frequent itemsets fall into two broad classes: (1) algorithms that are based on non-trivial SQL statements to query and update a database, and (2) algorithms that employ sophisticated in-memory data structures, where the data is stored into and retrieved from flat files. Most performance experiments have shown that SQL-based approaches are inferior to main-memory algorithms. However, the current trend of database vendors to integrate analysis functionalities into their query execution and optimization components, i.e., "closer to the data," suggests revisiting these results and searching for new, potentially better solutions. We investigate approaches based on SQL-92 and present a new approach called Quiver that employs universal and existential quantifications. This approach uses a table layout for itemsets, where a group of multiple records represents a single itemset. Hence, our vertical layout is similar to the popular layout used for the transaction table, which is the input of frequent itemset discovery. Our approach is particularly beneficial if the database system in use provides adequate strategies and techniques for processing universally quantified queries, unlike current commercial systems.
format application/pdf
identifier  http://www.informatik.uni-stuttgart.de/cgi-bin/NCSTRL/NCSTRL_view.pl?id=INPROC-2002-05&engl=1
language eng
publisher Prague, Czech Republic: unknown
source In: Proceedings of the International Workshop on Database Technologies for Data Mining (DTDM); Prague, Czech Republic, March 2002, pp. 51-66
ftp://ftp.informatik.uni-stuttgart.de/pub/library/ncstrl.ustuttgart_fi/INPROC-2002-05/INPROC-2002-05.pdf
subject Database Management Systems (CR H.2.4)
Database Applications (CR H.2.8)
data mining
association rules
relational division
mining and database integration
title Frequent Itemset Discovery with SQL Using Universal Quantification
type Text
Article in Proceedings